AITopics | bloom filter

Collaborating Authors

bloom filter

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Practical Near Neighbor Search via Group Testing: Supplementary Materials

Neural Information Processing SystemsApr-25-2026, 22:18:07 GMT

In this section, we provide proofs for all of the theorems introduced in the main text. We begin with a simple extension of the results of [3] for the Bloom filter false positive and negative rates. Then, we prove our main claim, which is that the query time of our data structure is sublinear, given some relatively weak assumptions on the stability of the query. Theorem 1. Assuming the existence of an LSH family with collision probability s(x,y) = sim(x,y), the distance-sensitive Bloom filter solves the approximate membership query problem with p 1 exp 2m t/m+ SLH We begin with a brief explanation of the results from [3]. Recall that a distance-sensitive Bloom filter is a collection of mbit arrays. Array iis indexed using an independent LSH function li(x). To insert a point xinto the ith array, we set the bit at location li(x) to '1.' To query the filter, we calculate the mhash values of the query and return "true" when at least tof the corresponding bits are '1.' To bound p (the true positive rate) and q (the false positive rate), we bound the probability that a single array returns "true."

artificial intelligence, equation, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

A Model for Learned Bloom Filters and Optimizing by Sandwiching

Neural Information Processing SystemsMar-16-2026, 17:59:25 GMT

Recent work has suggested enhancing Bloom filters by using a pre-filter, based on applying machine learning to determine a function that models the data set the Bloom filter is meant to represent. Here we model such learned Bloom filters, with the following outcomes: (1) we clarify what guarantees can and cannot be associated with such a structure; (2) we show how to estimate what size the learning function must obtain in order to obtain improved performance; (3) we provide a simple method, sandwiching, for optimizing learned Bloom filters; and (4) we propose a design and analysis approach for a learned Bloomier filter, based on our modeling approach.

artificial intelligence, machine learning, proceedings, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.81)

Add feedback

Robust Bloom Filters for Large MultiLabel Classification Tasks

Moustapha M. Cisse, Nicolas Usunier, Thierry Artières, Patrick Gallinari

Neural Information Processing SystemsFeb-18-2026, 21:26:06 GMT

This paper presents an approach to multilabel classification (MLC) with a large number of labels.

artificial intelligence, bloom filter, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Fast Partitioned Learned Bloom Filter

Neural Information Processing SystemsFeb-15-2026, 08:30:49 GMT

One such filter, the partitioned learned Bloom filter (PLBF), achieves excellent memory efficiency.

artificial intelligence, machine learning, plbf, (17 more...)

Neural Information Processing Systems

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.39)

Add feedback

b0ab42fcb7133122b38521d13da7120b-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 17:59:10 GMT

co 0, compression, gradient, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
South America > Brazil > São Paulo (0.04)
North America > United States > Oregon (0.04)
(4 more...)

Industry: Information Technology (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

84b744165a0597360caad96b06e69313-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 10:14:45 GMT

dataset, fedsim, similarity, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Germany (0.14)
Asia > Singapore (0.05)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Banking & Finance (1.00)
Information Technology > Security & Privacy (0.46)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Data Portraits: Recording Foundation Model Training Data

Neural Information Processing SystemsFeb-9-2026, 18:05:37 GMT

Foundation models are trained on increasingly immense and opaque datasets.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
North America > Canada > Ontario > Toronto (0.04)
(6 more...)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

86b94dae7c6517ec1ac767fd2c136580-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 06:21:44 GMT

ada-bf, bloom filter, fpr, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Harris County > Houston (0.04)
Oceania > New Zealand (0.04)
Oceania > Australia (0.04)
(2 more...)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.69)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Architecture > Real Time Systems (1.00)

Add feedback

5248e5118c84beea359b6ea385393661-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 16:36:53 GMT

dataset, equation, query, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Texas > Harris County > Houston (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

gHAWK: Local and Global Structure Encoding for Scalable Training of Graph Neural Networks on Knowledge Graphs

Sabir, Humera, Farooq, Fatima, Aboulnaga, Ashraf

arXiv.org Artificial IntelligenceDec-10-2025

Knowledge Graphs (KGs) are a rich source of structured, heterogeneous data, powering a wide range of applications. A common approach to leverage this data is to train a graph neural network (GNN) on the KG. However, existing message-passing GNNs struggle to scale to large KGs because they rely on the iterative message passing process to learn the graph structure, which is inefficient, especially under mini-batch training, where a node sees only a partial view of its neighborhood. In this paper, we address this problem and present gHAWK, a novel and scalable GNN training framework for large KGs. The key idea is to precompute structural features for each node that capture its local and global structure before GNN training even begins. Specifically, gHAWK introduces a preprocessing step that computes: (a)~Bloom filters to compactly encode local neighborhood structure, and (b)~TransE embeddings to represent each node's global position in the graph. These features are then fused with any domain-specific features (e.g., text embeddings), producing a node feature vector that can be incorporated into any GNN technique. By augmenting message-passing training with structural priors, gHAWK significantly reduces memory usage, accelerates convergence, and improves model accuracy. Extensive experiments on large datasets from the Open Graph Benchmark (OGB) demonstrate that gHAWK achieves state-of-the-art accuracy and lower training time on both node property prediction and link prediction tasks, topping the OGB leaderboard for three graphs.

artificial intelligence, machine learning, node, (19 more...)

arXiv.org Artificial Intelligence

2512.08274

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Technology: